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Abstract 

In this paper we show how to find nearly optimal embeddings of large trees in several natural 
classes of graphs. The size of the tree T can be as large as a constant fraction of the size of the graph 
G, and the maximum degree of T can be close to the minimum degree of G. For example, we prove 
that any graph of minimum degree d without 4-cycles contains every tree of size ecp and maximum 
degree at most d — 2ed — 2. As there exist d-regular graphs without 4-cycles of size O(d^), this 
result is optimal up to constant factors. We prove similar nearly tight results for graphs of given 
girth, graphs with no complete bipartite subgraph K^ t, random and certain pseudorandom graphs. 
These results are obtained using a simple and very natural randomized embedding algorithm, which 
can be viewed as a "self- avoiding tree- indexed random walk". 

1 Introduction 

We consider the problem of embedding a tree T in a given graph G. Formally, we look for an injective 
map / : V{T) V{G) which preserves the edges. We do not require that non-edges are mapped to 
non-edges, i.e. the copy of T in G need not be induced. Our goal is to find sufficient conditions on 
G in order to contain all trees of certain size, with maximum degree as large as a constant fraction 
(possibly approaching 1) of the minimum degree of G. 

1.1 Brief history 

The problem of embedding paths and trees in graphs has long been one of the fundamental questions 
in combinatorics. This problem has been extensively studied in extremal combinatorics, in the theory 
of random graphs, in connection with properties of expanders and with applications to Computer 
Science. The goal always has been to find a suitable property of a graph G which guarantees that it 
contains all possible trees with given parameters. We describe next several examples which we think 
are representative and give a good overview of previous research in this area. 



Extremal questions. The basic extremal question about trees is to determine the number of edges 
that a graph needs to have in order to contain all trees of given size. It is an old folklore result that 
a graph G of minimal degree d contains every tree T with d edges. This can be achieved simply by 
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embedding vertices of T greedily one by one. Since at most d vertices of G are occupied at any point, 
there is always enough room to embed another vertex of the tree. 

An old conjecture of Erdos and Sos says that average degree d is already sufficient to guarantee 
the same property. More precisely, any graph with more than (d— l)n/2 edges contains all trees with 
d edges. A clique of size d is an obvious tight example for this conjecture. The conjecture has been 
proved in several special cases, e.g. Brandt and Dobson ^ establish it for graphs of girth at least 5 
{girth is the length of the shortest cycle in a graph). In fact, they prove a stronger statement, that 
any such graph of minimum degree d/2 and maximum degree A contains all trees with d edges and 
maximum degree at most A. More generally, improving an earlier result of Luczak and Maxell |15j . 
Jiang proved that any graph of girth 2k + 1 and minimum degree d/k contains all trees with d edges 
and maximum degree at most d/k [12]. For general graphs, it has been announced by Ajtai, Komlos, 
Simonovits and Szemeredi [I] that they proved Erdos-Sos conjecture for all sufficiently large trees. 

A related statement, known as Loebl's (§ — § — §) conjecture [9], is that any graph on n vertices, 
with at least n/2 vertices of degree at least n/2, contains all trees with at most n/2 edges. Progress 
on this conjecture has been recently made by Yi Zhao [27j. Note that in the results discussed so far, 
the size of the tree is of the same order as degrees in the graph G. Without assuming any additional 
properties of G, this seems to be a natural barrier. 

Expanding graphs. Embedding trees of size much larger than the average degree of the graph is 
possible in graphs satisfying certain expansion properties. The first such result was established by Posa 
using his celebrated rotation/extension technique. Given a subset of vertices X of a graph G let N{X) 
denote the set of all neighbors of vertices of X in G. Posa [23J proved that if \N{X) \ X\ > 2\X\ — 1 for 
every subset X of G with at most t vertices, then G contains a path of length 3t — 2. This technique 
was extended to trees by Friedman and Pippenger [11]. They proved that if |A^(X)| > {d+ 'i-)\X\ for 
all subsets of size at most 2t — 2, then G contains every tree of size t and maximum degree at most 
d. The power of this technique is that while T can have degrees close to the minimum degree of G, 
it can be of size much larger than d, depending on the expansion guarantee. On the other hand, note 
that these techniques cannot embed trees of size larger than \G\/d, due to the nature of the expansion 
property. The result of Friedman and Pippenger has several interesting applications. For example, it 
can be used to show that for a fixed 5 > 0, d and every n there is a graph G with 0{n) edges that, even 
after deletion of all but 5\E{G)\ edges, continues to contain every tree with n vertices and maximum 
degree at most d. This has immediate corollaries in Ramsey Theory. 

The technique from [TT] also has an application for infinite graphs. For an infinite graph G, its 
Cheeger constant is h{G) = infx ^~^]xr~^' ^^ere X is a nonempty finite subset of vertices of G. Using 
the ideas of Friedman and Pippenger, one can show (see 0) that any infinite graph G with Cheeger 
constant d > 3 contains an infinite tree T with Cheeger constant d — 2. Benjamini and Schramm [5] 
prove a stronger result that any infinite graph with h{G) > contains an infinite tree with positive 
Cheeger constant. They use the notion of tree-indexed random walks to find such a tree. We will 
allude to this notion again later. 

Random and pseudorandom graphs. The random graph Gn,p is a probability space whose points 
are graphs on a fixed set of n vertices, where each pair of vertices forms an edge, randomly and 
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independently, with probability p. For random graphs, Erdos conjectured that with high probability, 
Gn,d/n foi' ^ fixed d contains a very long path, i.e., a path of length {l — a{d))n such that lim^^oo Oi{d) = 
0. This conjecture was proved by Ajtai, Komlos and Szemeredi [2] and, in a slightly weaker form, 
by Fernandez de la Vega [25]. Embedding trees, however, is considerably harder. Fernandez de la 
Vega |26j showed that there are (large) constants ai,a2 such that Gn^d/n contains any fixed tree T of 
size n/oi and maximum degree A < d/a2 w.h.p (i.e., with probability tending to 1 when n — > oo). 
Note that this is much weaker than containing all trees simultaneously, because a random graph can 
contain every fixed tree w.h.p, and still miss at least one tree w.h.p. Until recently, there was no result 
known on embedding all trees simultaneously. Alon, Krivelevich and Sudakov proved in [2] that for 
any e > 0, G^^d/n contains all trees of size (1 — e)n and maximum degree A such that 

d> — A3logAlog2(2/e). 

(All logarithms here and in the rest of this paper have natural base.) This result is nearly tight in 
terms of the size of T, and holds for all trees simultaneously. But it is achieved at the price of requiring 
that degrees in G are much larger than degrees in the tree. A similar result for pseudorandom graphs 
was also proved in [2]. A graph G is called an (n, d, A)-graph if G has n vertices, is d-regular (hence 
the largest eigenvalue of the adjacency matrix is d) and the second largest eigenvalue is A. Such graphs 
are known to have good expansion and other random-like properties. Alon, Krivelevich and Sudakov 
proved that any (n, d, A)-graph such that 

d ^ 160 . K/n - , X 

- > A^/^log 2/e 

A e 

contains all trees of size (1 — e)n and degrees bounded by A. Note that using the expansion properties 
of (n, d, A)-graphs, one could have used Friedman-Pippenger as well; however, one would not be able 
to embed trees larger than n/A in this way. 

Universal graphs. In a more general context, graphs containing all trees with given parameters 
can be seen as instances of universal graphs. For a family of graphs a graph G is called .F-universal, 
if it contains every member of .7-" as a subgraph. The construction of J'^-universal graphs for various 
families of subgraphs is important in applied areas such as VLSI design, data representation and 
parallel computing. For trees, a construction is known of a graph G on n vertices which contains all 
trees with n vertices and degrees bounded by d, such that the maximum degree in G is a function of 
d only [Tj. 

1.2 Our results 

We prove several results concerning embedding trees in graphs with no short cycles, graphs without 
a given complete bipartite subgraph, random graphs and also graphs satisfying a certain pseudoran- 
domness property. We embed trees with parameters very close to trivial upper bounds that cannot 
be exceeded: maximum degree close to the minimum degree of G, and size a constant fraction of the 
order of (or more precisely the minimum possible order of G under given conditions). A summary 



'^By the order of a graph, we mean the number of vertices. By size, we mean the number of edges. For trees, the two 
quantities differ only by 1. 
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of our main results follows. Here we assume that d and n are sufficiently large. 

1. For any constant k > 2, e < and any graph G of girth at least 2k + 1 and minimum degree 
d, G contains every tree T of size \T\ < \ed^ and maximum degree A < (1 — 2e)d — 2. 

2. For any G of minimum degree d, not containing Kg^t (a complete bipartite graph with parts of 
size s > t > 2), G contains every tree T of size |T| < g^^TT^myc? '-^ and maximum degree 

3. For a random graph Gn-p with d = pn > n^^^ for some constant /c, with high probability G^^p 
contains all trees of size 0{n/k) and maximum degree 0{d/k). 

It is easy to see that any graph of girth 2k + \ and minimum degree d has at least ^{d^) vertices. 
It is a major open question to determine the smallest possible order of such graph. For values of 
A; = 2, 3, 5 there are known constructions obtained by Erdos and Renyi [10] and Benson [6] of graphs 
of girth 2A; + 1, minimum degree d and order 0{d^). It is also widely believed that such constructions 
should be possible for all fixed k. This implies that our first statement is tight up to constant factors 
for = 2, 3, 5 and probably for all remaining k. Similarly, it is conjectured that for s > t there are 
i^s^t-free graphs with minimum degree d which have 0{d '-i) vertices. For s > {t — 1)!, such a 
construction was obtained by Alon, Ronyai and Szabo [3j (modifying the construction in [17]). Hence, 
the size of the trees we are embedding in our second result is tight up to constant factors as well. 
Finally, since the minimum degree of the random graph Gn,p is roughly pn, it is easy to see that for 
constant a > and p = n~" we are embedding trees whose size and maximum degree is proportional 
to the order and the minimum degree of Gn,p- Thus our third result is also nearly optimal. 

1.3 Discussion 

Local expansion. Using well known results from Extremal Graph theory, one can show that if 
graph G contains no subgraphs isomorphic to a fixed bipartite graph H (e.g., C2k or Kg^t) then it 
has certain expansion properties. More precisely, all small subsets of G have a large boundary. For 
example, if G is a G4-free graph with minimum degree d then all subsets of G of size at most d expand 
by a factor of Q{d). Otherwise we would get a 4-cycle by counting the number of edges between S 
and its boundary N{S) \ S. This simple observation appears to be a powerful tool in attacking various 
extremal problems and was used in [24j and [21] to resolve several conjectures about cycle lengths and 
clique-minors in H-fiee graphs. 

Therefore, it is natural to ask whether the expansion of H-hee graphs combined with the result 
of Friedman and Pippenger can be used to embed large trees. Recall that to embed a tree of size t of 
maximum degree d, Friedman and Pippenger require that sets of size up to 2t — 2 expand at least d+1 
times. For example, plugging this into the observation we made on the expansion of G4-free graphs 
only gives embedding of trees of order 0{d) in such graphs. This is quite far from the bound 0{d'^) 
which can be achieved using our approach. Similarly, in graphs of girth 2A; + 1, we can embed trees of 
size 0{d^), rather than 0{d^~^) as can be guaranteed by using Friedman-Pippenger. Therefore, our 
work can be seen as an extension of the embedding results for locally expanding graphs. It shows that 
using structural information about G, rather then just local expansion, one can embed in G trees of 
much larger size. 
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Extremal results. Our work sheds some light on why the Erdos-Sos conjecture, which we already 
discussed in the beginning of the introduction, becomes easier for graphs with no short cycles. This 
scenario was considered, e.g., in [8l[15l|12]. In particular, assuming that graph G has girth 2fe+l, k >2 
and minimum degree d, Jiang [12] showed how to embed in G all trees of size kd with degrees bounded 
by d. Although this is best possible, our result implies that this statement can be tight only for a 
relatively few very special trees, i.e., those that contain several large stars of degree d or extremely 
close to d. Indeed, if we relax the degree assumption and consider trees with the maximum degree at 
most (1 — e)d, then it is possible to embed trees of size 0{d'^) rather than 0{d). Moreover, a careful 
analysis of our proof shows that it still works for e which have order of magnitude fc^^^- Therefore 
even if we allow the degree of the tree to be as large as d — ck log d for some constant c, we are still 
able to embed all trees of size Q,{kd^"^ logd) S> kd. 

Random graphs. It is quite easy to prove an analog of the result of Fernandez de la Vega [26j on the 
embedding of a fixed tree of size proportional to n and maximum degree 0{pn) in the dense random 
graph Gn,p- Indeed for constant a < 1 and edge probability p = n~", this can be done greedily, vertex 
by vertex, generating the random graph simultaneously with the embedding. On the other hand, this 
simple approach cannot be used to embed all such trees with high probability, since there are too 
many trees to use the union bound. We provide the first result for simultaneous embedding of all 
trees of size 0(n) and maximum degree G(pn), in the random graph Gn,p for p = n~°' and constant 
a < 1. It is also interesting to compare our result with the work of Alon, Krivelevich, Sudakov [2]. 
They embed nearly spanning trees but with degree which is only a small power (roughly 1/3) of the 
degree of Gn,p- Although our trees are somewhat smaller (by constant factor), we can handle trees 
with degrees proportional to the minimum degree of the random graph. 

1.4 The algorithm 

All our results are proved using variants of the following very simple randomized embedding algorithm. 
First, choose arbitrarily some vertex r of T to be the root. Then for every other vertex u € V{T) there 
is a unique path in T from r to u. The neighbor of u on this path is called the parent of u and all the 
remaining neighbors of u are called children of u. The algorithms proceeds as follows. 

Algorithm 1. Start by embedding the root r at an arbitrary vertex f{r) £ V{G). As long as T is not 
completely embedded, take an arbitrary vertex u £ V{T) which is already embedded but its children are 
not. If f{u) has enough neighbors in G unoccupied by other vertices ofT, embed the children of u by 
choosing vertices uniformly at random from the available neighbors of f{u) and continue. Otherwise, 
fail. 

This algorithm can be seen as a variant of a tree-indexed random walk, i.e. a random process 
corresponding to a tree where each vertex assumes a random state depending only on the state of its 
parent. The notion of a tree- indexed random walk was first introduced and studied by Benjamini and 
Peres It is also used in the above mentioned paper of Benjamini and Schramm [5] to embed trees 
with a positive Cheeger constant into infinite expanding graphs. In our case, we consider in fact a 
self-avoiding tree-indexed random walk, where each state is chosen randomly, conditioned on being 
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distinct from previously chosen states. The corresponding concept for a random walk is a well studied 
subject in probability (see, e.g., [22j). Loosely speaking, we prove that our self-avoiding tree-indexed 
random walk behaves sufficiently randomly, in the sense that it does not intersect the neighborhood of 
any vertex more often than expected. To analyze the number of times the random process intersects 
a given neighborhood, we use large deviation inequalities for supermartingales. 

1.5 A supermartingale tail estimate 

In all our proofs, we use the following tail estimate. 

Proposition 1.1 Let Xi,X2, ■ ■ ■ ,Xn be random variables in [0, 1] such that for each k, 

I Xi,X2, . . . , Xk-i] < Uk- 
Let fi = X^ILi '^i- ^^6*^ /o*" (^ny < 6 < 1, 

n 2 

P[^X, >(l + ,5)/x]<e-V. 
i=l 

This can be derived easily from the proof of Theorem 3.12(b) in [14]. We re-state this theorem 
here: Let Yi,Y2, . . . ,Yn be a martingale difference sequence with —a^ < < 1 — ak for each k, for 
suitable constants 0^; and let a = a^. Then for any (5 > 0, 

n 2 

^ — ^ 5 an 

IP[2_, > San] < e ^{i+s/a) . 
fc=i 

A martingale difference sequence satisfies E[l^ | Yi, 12, . . . , = 0. However, it can be seen 
easily from the proof in [14J that for this one-sided tail estimate, it is sufficient to assume ^[Y^ \ 
Yi, Y2, . . . , Yi^i] < 0. (Such a random process is known as a supermartingale.) To show Proposition ll.H 
set Yk = Xk — ak and fj, = an = Y27=i ^k- The conditional expectations of X^ are bounded by o^, 
hence the conditional expectations of are non-positive as required. Since 5 < 1, we also replace 
2(1 -|- 5/3) by 3, and Proposition 11.11 follows. 

Note also that we can always replace /i by a larger value (e.g., by adding auxiliary random variables 
that are constants with probability 1), and the conclusion still holds. Hence, in Proposition 11.11 it is 
enough to assume Ylll=i '^i — Z^- 

2 Embedding trees in C4-free graphs 

The purpose of this section is to illustrate on a simple example the main ideas and techniques that we 
will use in our proofs. We start with C4-free graphs, which is a special case of two classes of graphs 
we are interested in: graphs without short cycles, and graphs without Kg^t (note that ^^2,2 = C^)- 

Let's recall Algorithm 1. For a given rooted tree T, we start by embedding the root r G V(T) 
at an arbitrary vertex /(r) G V{G). As long as T is not completely embedded, we take an arbitrary 
u G V{T) which is already embedded but its children are not. If f{u) has enough unoccupied neighbors 
in G, we embed the children of u uniformly at random in the available neighbors of f{u) and continue. 
Otherwise, we fail. 
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Theorem 2.1 Let e < 1/8, and let G he Cj^-free graph G of minimum degree at least d. For any tree 
T of size \T\ < ed'^ and maximum degree A < d — 2ed — 2, Algorithm 1 finds an embedding of T in G 
with high probability (i.e., with probability tending to 1 when d ^ oo). 

Example. Before we plunge into the proof, let us consider the statement of this theorem in a 
particular case, where G is the incidence graph of a finite projective plane. Let q = d — 1 he a prime 
or a prime power and consider a 3-dimensional vector space over the finite field ¥q. Let Vi be all 
2-dimensional linear subspaces of (lines in a projective plane), V2 all 1-dimensional linear subspaces 
(points in a projective plane) and two vertices from Vi and V2 are adjacent if their corresponding 
subspaces contain one another. This G has n = 2(g2 + g + 1) = 2{d^ - d + 1) vertices, it is bipartite 
and d-regular. Also, it is easy to see from the definition that G contains no C4. Clearly, we cannot 
embed in G trees of size larger than 0{d^) or maximum degree larger than d. In this respect, our 
theorem is tight up to constant factors. 

It is also worth mentioning that in the analysis of our simple algorithm, the trade-off between the 
size of T and the maximum degree A is close to being tight. Indeed, we show that for A = (1 — e)d, 
our algorithm cannot embed trees of size much larger than ed^. Suppose we are embedding a tree T 
of depth 3, where the degrees of the root and its children (level 1) are \fd. On level 2, the degrees are 
ed except one special vertex z of degree (1 — e)d. On level 3, there are only leaves. The size of this 
tree is ed^ + Q){d). 

We can assume that the root is embedded at a vertex corresponding to a point o. The level- 1 
vertices are embedded into a set Li of ^fd random lines through a. The level-2 vertices are embedded 
into a set P2 of d random points on these lines. Every point in the projective plane (except a) has 
the same probability of appearing in hence this probability is d/{d'^ — d) = l/{d — 1). The level-3 
vertices are embedded into random lines -L3 through points in each line through a point in P2 with 
probability e. Now every line has probability roughly e of being in L3, because one of its points on the 
average appears in P2. Consider the point where we embed the special vertex z and assume this is the 
last vertex we process in the algorithm. Each of the d lines through this point has probability roughly 
e of being occupied by a level-3 vertex, so on the average, only (1 — e)d lines are available to host the 
children of z. Therefore, our algorithm cannot succeed in embedding more than (1 — e)d children of z. 

Proof of Theorem 12.11 Let's fix an ordering in which the algorithm processes the vertices of T: 
V{T) = {1,2, ... , |y(T)|}. Here, 1 denotes the root and the ordering is consistent with the structure 
of the tree in the sense that every vertex can appear only after its parent. In step 0, the algorithm 
embeds the root. In step i, the children of t are embedded randomly in the yet unoccupied neighbors 
of f{t) G V{G). If t is a leaf in T, the algorithm is idle in step t. 

Our goal is to argue that for large d, with high probability, the algorithm never fails. The only 
way the algorithm can fail is that for a vertex t G y{T), embedded at f = f{t) € V{G), we are not 
able to place its children since too many neighbors of f in G have been occupied by other vertices of 
T. This is the crucial "bad event" we have to analyze: 

Let By denote the event that at some point, more than 2ed + 2 neighbors of v are occupied by 
vertices of T other than the children of f~^{v). 

If we can show that with high probability, does not occur for any v £ V{G), then the algorithm 
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clearly succeeds. To do this, we will modify our algorithm slightly and force it to stop immediately 
at the moment when the first bad event occurs. Thus, in analyzing B^, we can assume that for any 
w ^ V the event B^, has not happened yet. 

Our strategy is to prove that the probability of B^ for any given vertex v, even conditioned on 
our embedding getting "dangerously close" to v, is exponentially small in d. Then, we argue that 
the number of vertices which can ever get dangerously close to our embedding (i.e., the number of 
bad events we have to worry about) is only polynomial in d. Therefore, we conclude that with high 
probability, no bad event occurs. 

Lemma 2.2 Let e < ^ and d > 24. For a vertex v G V{G), condition on any history 7i of running 
the algorithm up to a certain point such that at most 2 vertices of T have been embedded in N(v). 
Then 

^B^ I n] < e-"^l^^. 

Proof. For t = 1, 2, . . . , |y(r)|, let Xt be an indicator variable of the event that /(t) ^ v but 
some child of t gets embedded in N{v). Here we use the property that G is C4-free. Note that, if 
f{t) = w V, w can have at most one neighbor in N{v), otherwise we get a 4-cycle. Therefore, t can 
have at most one child embedded in N{v) and Xt represents the number of vertices in N{v), occupied 
by the children of t. 

We condition on a history 7i of running the algorithm up to step h, such that at most 2 vertices of 

ITI 

N{v) have been occupied so far. The bad event By can occur only ifX = J2tJh+i > 2ed. Therefore, 
our goal is to prove that this happens only with very small probability. 

Each vertex chooses the embedding of its children randomly, out of at least d — 2ed — 2 still available 
choices (here we assume that no bad event occurred before By for any w v, oi else the algorithm 
has failed already). Thus we get 

^ ' - d-2ed-2 - 2d/3 

where dxit) is the number of children of the vertex t in T. We also used e < 1/8 and d > 24. This 
holds even conditioned on any previous history of the algorithm, since the decisions for each vertex 

ITI 

are made independently. We are interested in the probability that X = Yl[=h+i -^t exceeds 2ed. Using 
the fact that X^^g-p dxit) = \T\ — 1 < ed"^, we can bound the expectation of X by 



t=h+l t&T ' 

We use the supermartingale tail estimate (Proposition II. ip with S = ^ and = |ed: 

F[X > 2ed] < e~^'^l'^ = e'^/^T ^ g-ed/is^ 

Therefore, the bad event By happens with probability at most . □ 

Our final goal is to argue that with high probability, no bad event By occurs for any vertex 
V G V{G). Since the number of vertices could be potentially unbounded by any function of d, we 



8 



cannot apply a straightforward union bound over all vertices in the graph. However, we observe that 
the number of vertices for which can potentially occur is not very large. 

Define P„ to be the event that at some point in the algorithm, two vertices in N(v) are occupied 
by vertices of T. This is the event that the embedding of T gets "dangerously close" to v. Observe 
that if is "witnessed" by the pair of vertices of T which are placed in N{v), each pair of vertices of 
T can witness at most one event "Dy (otherwise the same pair is in the neighborhood of two vertices 
which implies a C4). Since T has at most ed? vertices, the event can occur for at most e^d^ vertices 
in any given run of the algorithm. 

Clearly, event C P^. Let's analyze the probability of By, conditioned on T>y. The event T>y can 
be written as a union of all histories 7i of running the algorithm up to the point where two vertices of 
T get embedded in N{v). By Lemma 12.21 

F[By I n] < e-"^/^^ 

for any such history TC. By taking the union of all these histories, we get 

F[By I Vy] < e-"^/^^ 

Now we can estimate the probability that By ever occurs for any vertex v: 

¥[3v G V; By occurs] < ^ F[By] = ^ F[By \ Vy]F[Vy] < e'''^/^^ ^ F[Vy]. 

Since Vy can occur for at most e^d^ vertices in any given run of the algorithm, we have X^„gy IP[Pd] < 
e^d^. Thus 

F[3v E V;By occurs] < e^d^e"""'/^^ ^ 0, 
when d ^ 00. Hence the algorithm succeeds with high probability. □ 

3 Embedding trees in rfree graphs 

Next, we consider the case of graphs which contain no complete bipartite subgraph Kg t with parts 
of size s and t. We assume that s > t. It is known that the extremal size of such graphs depends 
essentially only on the value of the smaller parameter t. Indeed, by the result of Kovari, Sos and Turan 
|18j the number of vertices in /Cg^t-free graph with minimum degree d is at least cd*/(*~^\ where only 
the constant c depends on s. For relatively high values of s (s > (t — 1)!) there are known constructions 
(see, e.g., [HI [3]) of K^.t-free graphs achieving this bound. Moreover, it is conjectured that 0((i*/(*~^)) 
is the correct bound for all s > t. This implies that one cannot embed trees larger than 0((i*/'^*^^-*) 
in a Kg^t-^ree graph with minimum degree d. Also, it is obvious that the maximum degree in the 
tree should be 0(d). In this section we show how to embed trees with parameters very close to these 
natural bounds that cannot be exceeded. It is easier to analyze our algorithms in the case when the 
maximum degrees in the tree are in fact bounded by 0{d/t). First, we obtain this weaker result, 
and then present a more involved analysis which shows that our algorithm also works for trees with 
maximum degree at most ^d. Our algorithm here is a slight modification of Algorithm 1. 
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Algorithm 2. For each vertex v € V{G), fix a set of d neighbors A^+(f ) ^ N{v). Start by embedding 
the root of the tree r & T at an arbitrary vertex f{r) G V{G). As long as T is not completely embedded, 
take an arbitrary vertex u G V{T) which is already embedded but its children are not. If f{u) has 
enough neighbors in N^{f{u)) unoccupied by other vertices of T, embed the children of u one by 
one, by choosing vertices uniformly at random from the available vertices in N-^{f{u)), and continue. 
Otherwise, fail. 

The only difference from the original algorithm is that when embedding the children of a vertex, we 
choose from a predetermined set of d neighbors rather than all possible neighbors. Since the maximum 
degree of G can be very large, this modification is useful in the analysis of our algorithm. It allows us 
to bound the number of dangerous events. However, we believe that the original algorithm works as 
well and only our proof requires this modification. 

Theorem 3.1 Let G be a Kg^t-ff^^ graph (s > t) with minimum degree d. For any tree T of size 
\T\ < ^s''^/^*~^^dl^f^*~^^ and maximum degree A < -^d, Algorithm 2 finds an embedding of T in G 
with high probability. 

Proof. We follow the strategy of defining bad events for each vertex v G y{G) and bounding the 
probability that any such event occurs. 

Let By denote the event that at some stage of the algorithm, more than ^d + 2t vertices in N+{v) 
are occupied by vertices ofT other than children of f~^{v). 

Note that (as in the previous section), to bound the probability of a bad event, we assume that 
our algorithm stops immediately at the moment when the first such event occurs. To simplify our 
analysis, we also assume that the children of every vertex of T are embedded in some particular order, 
one by one. As long as By does not occur, we have at least ^d — 2t unoccupied vertices in N+{v). 
Since degrees in the tree are bounded by < ^d, we have enough space for the children of any 
vertex to be embedded at N-^-{v). As we embed the children one by one, the last child still has at least 
^d — 2t — -^d > ^d choices available (for large enough d). 

The new complication here is that another vertex w could share many neighbors with v. Unlike in 
the case of K2,2-free graphs, where any two vertices can share at most 1 neighbor, in i^s^^-free graphs 
(for s > t > 2), we do not have any bound on the number of shared neighbors. Therefore we have to 
proceed more carefully. For every vertex v in G, we partition all other vertices into two sets depending 
on how many neighbors they have in N^{v): 

• Ly = {w^v: \N+{v) n N+(w)\ < 2st^d^}. 

• My = {w^v: \N+{v) n N+{w)\ > 2s^^dT^}. 

The idea is that vertices in L„ are harmless because the fraction of their children that affects N^(v) 
is 0((i~^/*^*~^)). Since the trees we are embedding have size ©(d^+^Z^*"^)), we show that the expected 
impact of these children on N-^,{v) is 0(d). 

The vertices in My have to be treated in a different way, because the fraction of their children in 
N^{v) could be very large. However, we prove that the total number of edges between My and N^{v) 
cannot be too large, otherwise we would get a copy of Kg^t in G. Therefore, the impact of the children 
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of My on N^(v) can be also controlled. Again, we "start watching" a bad event for vertex v only at 
the moment when it becomes dangerous. 

Let denote the event that at least t vertices in N+{v) are occupied by vertices of tree T other 
than children of f~^{v). 

Lemma 3.2 Let 7i be a fixed history of running the algorithm up to a point where at most t vertices 
in N^(v) are occupied. Conditioned on TC, the probability that children of vertices embedded in L^, 
will ever occupy more than jd + t vertices in N^{v) is at most e~'^l'^^ . 

Proof. We use an argument similar to the proof of Lemma 12. 2i Fix an ordering of the vertices 
of T starting from the root, i = 1, 2, . . . , |r|, as they are processed by the algorithm. Suppose that 
vertices 1, . . . ,h were embedded during the history Ti.. Let Xi be the indicator variable of the event 
that i G T is embedded in N^{v) and the parent of i was embedded in L^. As long as the algorithm 
does not fail (i.e., no bad event happened), for each vertex i £ T when it is embedded we have at 
least d — ^d — 2t — ^d > jd choices where to place the vertex. This holds even if we condition on any 
fixed embedding of vertices j < i. Moreover, the embedding decisions for different vertices are done 
independently. Since we assume that the parent of i was embedded in L^, at most 2s^/*^*~"'^^(i*^*~^)/*^*~^) 
of these choices are in N^{v). Therefore, conditioned on any previous history TC such that i was not 
embedded yet 



1 t~2 . 

\x, = i\n]<-' 



t-i 

Summing up over such vertices i in the tree, whose number is at most |T| < ^s~^^^^~^^c[''^^^~^\ we 
have 

^1 -- - -■ --.d. 

t=h+l i=h+l 

Since, the upper bound on P[Xj = 1 | "H] is still valid even if we also condition on a fixed embedding 
of all vertices j < i, by Proposition 11.11 with ji = hd and 6 = 1, 



[ 5^ = ^ nx^ = l\'H]<\T\.sQ'-' <\d. 



\T\ 



4 
=/i+i 



< e 



-d/2i 



By definition of during the first h steps of the algorithm only at most t vertices in N^{v) have 
been occupied. Therefore, the probability that more than jd + t vertices are ever occupied is at most 

g-d/24_ □ 

Next, we treat the vertices whose parent is embedded in M^. Recall that each vertex in My has 
many neighbors in N^{v). However, the number of edges between and A^+(w) cannot be too large. 
Observe that there is no Kg^t-i in G with s vertices in A^+(f) and t — 1 vertices in Mj,, otherwise we 
would obtain a copy of Kg^t by adding v to the part of size t — \. Also, this shows that for t = 2, 
My must be empty. Indeed, by definition any vertex in M^ has at least 2s neighbors in Nj^{v), which 
together with vertex v would form K2s,2- So in the following, we can assume s > i > 3. The following 
is a standard estimate in extremal graph theory, whose short proof we include here for the sake of 
completeness. 
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Lemma 3.3 Consider a subgraph containing the edges between and N^(v), where |A^+(f )| = d, 
every vertex in has at least 2s^^^^~^^ d^^~'^^^^^~^^ neighbors in N^{v) and the graph does not contain 
Kg^t-i (with s vertices in N+{v) and t — 1 vertices in My). Then has at most 2td edges. 

Proof. Let m denote the number of edges in and assume m > 2td. Let N denote the number of 
copies of Ki^t-i (a star with t — 1 edges) in Hy, with 1 vertex in N^(v) and t — 1 vertices in M„. By 
convexity, the minimum number of Ki t-i in is attained when ah vertices in Nj^{v) have the same 
degree m/\Nj^{v)\. Therefore 



(m \ / m 



,t - 1, 

Our assumption that m > 2td implies that ^, ^ — 1, . . . , ^ — (t — 2) > ^ and therefore 



N > d 



(m\t-l 
\2d> 



m 



(t-1)! (t-l)!2*-id*-2- 



Since all the degrees in My are at least 2s^/(*-^)(i(*~2)/(*~^\ we have m > 2s^/(*"^)d(*"2)/(*"^) |M^| 
Then m*"^ > 2^-^^ sd^-^\My\*-^ and 

^ s\My\'-^ ^ f\My\ 

- (t - l)!2*-id*-2 - (t-1)! -\t-l 

Consequently, there must be a (t — l)-tuple in My which appears in at least s copies of Ki^t-i- This 
creates a copy of Kg^t-i, a contradiction. □ 

Lemma 3.4 Let TC be a fixed history of running the algorithm up to a point where at most t vertices 
in N^(v) are occupied. Then, conditioned on TC, the probability that children of vertices embedded in 
My will ever occupy more than jd + t vertices in N^{v) is at most tVde~^^ . 

Proof. As we mentioned, we can assume s > t > 3, otherwise My is empty. Consider the vertices in 
My and for every w G My denote the number of edges from w to -/V+(?^) by dyj. We know that each 
vertex w G My has dyj > 2s^/(*~^)(i(*~2)/(*^i) > 2\/d (using t > 3). From Lemma [3^ we know that 
the total number of these edges is X^^gjv^^ diy < 2td. This implies that \My \ < 2td/{2^fd) < t\/d. 

For w £ My, let Xyj denote the number of tree vertices embedded in N-^{v) after the history Ti., 
whose parent is embedded at w. We claim that with high probability, < ^d^. This can be seen 
as follows. Suppose that f{x) = w for some x G V{T). The degree of x in T is at most ^d and the 
children of x are embedded one by one. Hence as we already explained, if no bad event By, happened 
so far, each child y has at least jd choices available for its embedding. Therefore, even conditioned 
on the embedding of the previous children, the probability that y is embedded in N^{v) is at most 
p = min{l, dy;/{\d)}. So satisfies the conditions of Proposition [TTT] with ^ = -^d-dy, / {\d) = -^dy,. 
By Proposition 11.11 with (5 = 1, 



[Xy, > —dy,] < e-^l^ = e-^t<i^ < e-^t^, 
8t 
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using d.^ >2y/d. By the union bound, the probabihty that Xyj > -^d^ for any w G is at most 
|M^|e~23tV^ < t^e"2it^. Otherwise, 

V X^<— V dn,<—-2td = -d. 

8t 8t 4 

Together with the t vertices possibly occupied within history H, this gives at most jd + t vertices 
occupied in N^{v). □ 
Having finished all the necessary preparations we are now ready to complete the proof of Theo- 
rem [XTJ The bad event can occur only if more than jd + t vertices are occupied in N^{v) by 
children of vertices in or more than jd + t vertices by children of vertices in M^. As we proved, 
each of these events has probability smaller than t^/de~^^^'^^^\ therefore the probability of is at 
most 

2tVde-^/(24t)^ This holds even if we condition on the event Vy (a disjoint union of histories Ti.) 
which occurs at the moment when t vertices in N-^{v) are occupied. 

Let's estimate the number of events T>^ which can occur. The event is witnessed by a f-tuple of 
vertices of tree T which are embedded in N^{v). The same i-tuple cannot be a witness to s different 
events Vy, because then we would have a copy Ks^t in our graph G. Therefore, each t-tuple can witness 
at most s — 1 events and the total number of events Vy is bounded by {s — 1)\T\* < sd'^*. Since Vy can 
occur for at most sd^* vertices in any given run of the algorithm, we have X^^gy < sd^*. Thus 

F[3v G V;By occurs] < ^ F[By] = ^ F[By \ Vy]F[Vy] 

< 2tVde-^t^"^F[Vy] <2std'^^+he-^t^ 

which tends to as d ^ oo. □ 

Finally, we show how to prove the same result for trees whose degrees can be a constant fraction 
of d, independent of t. The following is a strengthened version of Theorem 13.11 

Theorem 3.5 Let G be i^s_t-/ree graph G (s > t) of minimum degree d. For any tree T of size 
\T\ < ^s^^^^^~^^ d^^^^~^^ and maximum degree A < ^^d, Algorithm 2 finds an embedding of T in G 
with high probability. 

Proof. The proof is very similar to the proof of Theorem 13.11 with some additional ingredients. We 
can assume that t > 5, otherwise the result follows from Theorem 13.11 directlv. We focus on the new 
issues arising from the fact that degrees in the tree can exceed 0{d/t). For a fixed vertex consider 
again the set My defined by 

My = {w^v: \N^{v) n N+{w)\ > 2s't^d't^}. 

We know from Lemma [3.3l that the number of edges from My to N^{v) is bounded by 2td. Before, 
we argued that since degrees are bounded by 0{d/t), the expected contribution of vertices embedded 
along edges from My to N^{v) cannot be too large. The vertices in T that could cause trouble are 
those embedded in My, whose degree is more than 0{d/t). The contribution of the children of these 
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vertices to iV+(u) might be too large. Hence we need to argue that not too many vertices of this type 
can be embedded in M^. 

First, observe that using Lemma [331 and the definition of My, the size of My is bounded by 

,,,, e(My,N+(v)) 2td i 

2s'-idt-i 2s*-i(i*-i 

Similarly, if we denote by Q the vertices of T with degrees at least ^d, the number of such vertices 
is bounded by 

2\T\ ^d^ , 1 
\Q\<^<^^^ = 2td—K 

64t mt 

Our goal is to prove that not many vertices from Q can be embedded in My. For that purpose, we 
also need to define a new type of "bad event" Cy and "dangerous event" £y. 

The event £y occurs if any vertex of the tree is embedded in My . The event Cy occurs if after the 
first vertex embedded in My, at least 8 vertices from Q are embedded in My. 

Now, consider any tree vertex g G Q. At the moment when we embed q, there are at least jd choices, 

1 

unless Buj happened for some vertex w and the algorithm has failed already. Since \My\ < td*-^ , the 
probability of embedding q into My, even conditioned on any previous history TC', is 

M„| itd'^ At 



nm ^ M, I «'] < 1^ < _ < _ 



for t > 5. We condition on any history TC up to the first vertex embedded in My, and estimate the 
probability that at least 8 vertices from Q are embedded in My after this moment. For any particular 
8-tuple from Q, this probability is bounded by (4t/(i^/^)^ = (4t)^/(i^. The number of possible 8-tuples 
in Q is at most |Q|^ < (2tfi^/(*-^))S < {2t)^d^ for t > 5. Hence, 

I n] < ^^{2tfd^ = 

By averaging over all histories up to the moment when the first vertex is embedded in My, we get 

^Cy I £y] < 88*16/^1 

Consider the number of events £y that can ever happen. For any event £y, there is a witness vertex 
X G V{T), mapped to f{x) = w £ My. Observe that the definition oi w £ My is symmetric with 
respect to {v,w), i.e., we also have v G M^. We know that |Mu,| < td^^^^~^^ for any w gV, therefore 
each vertex of the tree can be witness to at most td'^^^^~^^ events £y. In total, we can have at most 
\T\ ■ td^^^^~^^ < (i*/^*""'^^ • td^^^^~^^ < td^ events £y. Since £y can occur for at most td^ vertices in any 
given run of the algorithm, we have Y.vev^[^v\ < td^ ■ Hence, 



¥[3v G V; Cy occurs] < ^ P[C^,] = ^ P[C^, | £y]¥[£y 



which tends to for d — > cxd. So, with high probability, no event Cy happens. 
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Given that does not occur for any vertex, we can carry out the same analysis we used to prove 
Theorem 13. 11 The only difference is that each vertex v might have up to 9 vertices from Q embedded 
in My (8 plus the first vertex ever embedded in M^). Since the degrees in T are bounded by even 
if the children of these vertices were embedded arbitrarily, still they can occupy at most vertices 
in Nj^{v). The number of vertices in A^+(f ) occupied through vertices in Li, or the contribution of the 
children of vertices in T with degree 0{d/t) that were embedded in can be analyzed just like in 
Theorem 13.11 Thus, with high probability, at most \d+ i^d + 2t < jd vertices are occupied in any 
neighborhood and so at least ^d vertices are always available to embed any vertex of the tree. □ 

4 Graphs of fixed girth 

In this section we consider the problem of embedding trees into graphs which have no cycle of length 
shorter than 2k + 1 for some k > 1. (If the shortest cycle in a graph has length 2A; + 1, such a graph 
is said to have girth 2k + 1.) We also assume that the minimum degree in our graph is at least d. 
It is easy to see that such G must have Q{d^) vertices, because up to distance k from any vertex v, 
G looks locally like a tree. It is widely believed that graphs of minimum degree d, girth 2A; + 1, and 
order 0{d^) do exist for all fixed k and large d. Such constructions are known when k = 2,3 and 5. 
Since our graph might have order 0{d^), we cannot aspire to embed trees of size larger than 0{d^) 
in G. This is what we achieve. For the purpose of analysis, we need to modify slightly our previous 
algorithms. 

Algorithm 3. For each v ^ V , fix a set of its d neighbors N^{v). Assume that T is a rooted tree 
with root r. Start by making k random moves from an arbitrary vertex vi £ V , in each step choosing 
a random neighbor Vi-^i G N^{vi). Embed the root of the tree at f{r) = v^- 

As long as T is not completely embedded, take an arbitrary vertex s G y{T) which is embedded but 
its children are not. If f{s) has enough available neighbors in N^{f{s)) unoccupied by other vertices 
of T , embed the children of s among these vertices uniformly at random. Otherwise, fail. 

The following is our main result for graphs of girth 2k +\. 

Theorem 4.1 Let G be a graph of minimum degree d and girth 2k + \. Then for any constant e < 
Algorithm 3 succeeds with high probability in embedding any tree T of size jed^ and maximum degree 
A(T) <d-2ed-2. 

To prove this theorem, we will generalize the analysis of the C4-free case to allow embedding of 
substantially larger trees. The solution is to consider multiple levels of neighborhoods for each vertex. 
Starting from any vertex v € V{G), we have the property that up to distance k from v, G looks like 
a tree (otherwise we get a cycle of length at most 2k). Consequently, for any vertex w, there can be 
at most one path of length k from w to v . Therefore, embedding a subtree whose root is placed at w 
cannot impact the neighborhood of v too much. 

In fact, neighbors to be used in the embedding are chosen only from a subset of d neighbors N^{v). 
We can define an orientation of G where each vertex has out-degree exactly d, by orienting all edges 
from V to N^{v). (Some edges can be oriented both ways.) Then, branches of the tree T are embedded 
along directed paths in G. 
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Definition 4.2 For a rooted tree T , with a natural top-to-bottom orientation, let Lk-i{x) define the 
set of descendants k — 1 levels down from x £ V{T). 

For a tree vertex x G V{T), denote by Xy^^ the number of vertices in Li^_i(x) that end up embedded 
in N^{v), before the children of f~^{v) are embedded. 

For a vertex v G V{G), denote by X„ the total number of vertices in T that end up embedded in 
Nj^iv), before the children of f~^{v) are embedded. 

We extend T to a larger rooted tree T* by adding a path of length k — 1 above the root of 
T and making the endpoint of this path the root of T*. Observe that our embedding algorithm 

proceeds effectively as if embedding T* , except the first k — 1 steps do not occupy any vertices 
of G. Each embedded vertex y G V(T) is a {k — l)-descendant of some x £ V{T*) and hence 
V{T) = \JxeV{T*) -^fe-i(^)- summing up the contributions over x G ^(r*), we get 

xeV{T*) 

Our goal is to apply tail estimates on X^ in order to bound the probabilities of "bad events" . Just 
like before, we need to be careful in summing up these probabilities, since the size of the graph might 
be too large for a union bound. We start "watching out" for the bad event only after a "dangerous 
event" occurs. We also stop our algorithm immediately after the first bad event happens. 

Event By occurs when X^ > 2ed + 2. Event Vy occurs whenever at least two vertices in N^(v) can 
be reached by directed paths of length at most k — 1, avoiding v, from the embedding ofT*. By the 
embedding ofT*, we also mean the vertices visited in the first k — 1 steps of the algorithm, which are 
not really occupied. 

Suppose qi,q2 are the first two vertices in N^{v) that can be reached by directed paths of length 
at most k — 1, avoiding v, from the embedding of T* . Then we define a modified random variable 
Xy^x as the number of vertices in which are embedded in N^{v) \ {qi,q2}, but not through 

V itself. In other words, these random variables count the vertices occupied in 7V+(u), not counting qi 
and q2. Observe that Xy < '}2xeV{T*) ^v,x + 2. 

Lemma 4.3 Assume the girth of G is at least 2k + 1. Fix an ordering of the vertices ofT* starting 
from the root, {xi,X2,x^, . . .), as they are processed by the algorithm. Let Ti. be a fixed history of 
running the algorithm until two vertices qi,q2 € N^{v) can be reached from an embedded vertex by a 
directed path (avoiding v) of length at most k — 1. Then for any vertex Xi G V{T*), Xy^^i is a 0/1 
random variable such that 

^\-^V,Xi — 1 I ^1 -^V,Xl ) -^V,X2 ) • • • ) -^V,Xi-l] — 



{d-2ed-2f-^' 

Proof. First, note that any vertex xi embedded during the history H has Xy^^i = 0. (Since the only 
vertices in N^(v) possibly reachable within k — 1 steps from f{xj) are qi and 52-) Therefore we can 
assume that the embedding of .Xj together with the embedding of the subtree of its descendants in 
T* is still undecided at the end of TC. Let /C denote the event that Xi is embedded so that there is a 
directed path of length exactly k — 1 from /(xj) to N^{v), which avoids v and has endpoint in N^{v) 
other than qi,q2- Observe that this is the only way Xy^^i could be non-zero. Indeed, if Xy^^i = !> 
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then there is a branch of tree T* of length k — 1 from Xi to some y that was mapped to a path from 
f{xi) to Nj^{v) such that the vertex next to last is not v. However, such a path from /(xj) to Nj^{v), 
if it exists, is unique. If we had two different paths like this, we could extend them to two paths of 
length k between /(xj) and v, which contradicts the girth assumption. Note that fC occurs only if this 
unique path leads to a vertex of N^{v) other than qi or 52- Also, we have that at most one vertex 
y G Lk-i{xi) can be embedded in N^{v). The variable X^^^^. is equal to 1 when this happens for some 
y £ Lk-i{xi), and otherwise. 

We bound the probability that X^^^i = 1, conditioned on {TC, X^^^i, ■ ■ ■ , X^^xi^i)- In fact, let's 
condition even more strongly on a fixed embedding £ of all vertices of T except for the descendants of 
Xi. We also assume that £ satisfies /C, i.e. /(xj) is at distance exactly k — 1 from N+{v), since otherwise 
Xv,Xi = 0. We claim that any such embedding implies the values of X^^^i , ■ ■ ■ , X^^x^^i ■ For vertices Xj 
such that does not intersect the subtree of Xi, this is clear because the embedding of these 

vertices is fixed. However, even if Lfc_i(xj) intersects the subtree of Xj, X^^Xj is still determined, since 
none of these vertices can be embedded into N^{v). Indeed, any descendant of Xi which is in Lk-i{xj) 
must be also in Lk>{xi) for some k' < k — 1. If the embedding of Lk'{xi) intersects N^{v), we obtain 
that there are two paths from f{xi) to v, one of length k and another of length k' + 1 < k. Together 
they form a cycle of length shorter than girth, a contradiction. 

Now fix a vertex y G Lk-i{xi). Every vertex Xj G T*, when embedded, chooses randomly from 
one of the available neighbors of the vertex of G, in which its parent has been embedded. As long 
as no bad event happened so far (otherwise the algorithm would have terminated), there are at least 
d — 2ed — 2 candidates available for f{xj). Therefore, each particular vertex has probability at most 
l/{d — 2ed — 2) of being chosen to be f{xj). The probability that f{y) £ X^{v) is the probability that 
our embedding follows a particular path of length k — 1. By the above discussion, this probability is 
at most - 2e(i - 2)'=-^ (Note that by our conditioning, this path might be already blocked by 
the placement of other vertices; in such a case, the probability is actually 0.) Using the union bound, 
we have 

^ ' * ' ^ - (d - 2e(i - 2)'=-! 
Since the right hand side of this inequality is a constant, independent of the embedding, we get the same 
bound conditioned on {7i,Xy^xi, ■ ■ ■ > -'^ij.Xi.i , ^) and hence also conditioned on {7i,Xv^xi, ■ ■ ■ ,Xv^Xi-i)- 
□ 

Now we are ready to use our supermartingale tail estimate from Proposition 11.11 to bound the 
probability of a bad event. 

Lemma 4.4 Assume e < ^ and \T\ < \ed^ . For any vertex v £ V{G), condition on the dangerous 
event Vy. Then for large enough d, the probability that the bad event By happens is 



Vy] < e 



-ed/3 



Proof. The bad event means that Xy > 2ed + 2. As before, first we condition on any history 7i up to 
the point when T>y happens. At this point, two vertices qi,q2 G Xj^{v) are within distance A; — 1 of the 
embedding of T* constructed so far. We consider these two vertices effectively occupied. Our goal is 

IT* I ~ 

to prove that the number of additional occupied vertices in A^+(f ) is small, namely X]i=i -^v,Xi ^ 2ed. 



17 



By Lemma 14.31 we know that 

\Lk-i{xi 



((i-2ed-2)*^-i' 
Therefore the expectation of Xy = Xll^i -^v^Xi is bounded by 

Here we used that e < d large enough, and \T\ < \ed^ . So we can set ^ = ed, 5 = \ and use 
Proposition 1 1 . 1 1 to conclude that, 

P[X„ > 2ed I n] < e-"^/^. 

The same holds when we condition on the event P^, which is the disjoint union of all such histories 
7i. Consequently, < Xy + 2 < 2ed + 2 with high probability, which concludes the proof. □ 

To finish the proof of Theorem 14. 1^ we show that with high probability. By does not happen for 
any vertex v £ V. First, let's examine how many events can possibly occur for a given run of the 
algorithm. Every vertex v for which happens has a "witness pair" of vertices in N^(v) satisfying 
the condition that they can be reached by directed paths of length at most k — 1 from the embedding 
of T*. The number of such vertices is at most \T*\d^^^ < d^^ . Also, observe that the same pair can 
be a witness to at most 1 event P^,, otherwise we have a 4-cycle in G which contradicts the high girth 
property. Hence the number of possible witness pairs is at most 

2 



and each event T)^ has a unique witness pair. Therefore, the expected number of events is 

V 

Now we bound the probability that any bad event Bjj occurs. 

F[3v G V; By occurs] < ^ r[By] = ^ F[By \ Vy]F[Vy] 

For a constant k and d oo, this probability tends to 0. □ 



5 Random graphs and the property V{d^k^t) 

The main objective of this section is to obtain nearly optimal tree embedding results for random 
graphs. In our analysis, we do not actually require true randomness. The important condition that G 
has to satisfy is a certain "pseudorandomness" property, stated below. Roughly speaking, the property 
requires that there are not too many paths between any pair of vertices, compared to how many paths 
a random graph would have. 
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Property V{d,k,t). Let d,k and t be positive integers. A graph G on n vertices satisfies property 
V{d,k,t) if 

1. G has minimum degree at least d. 

2. For any u,v G V, the number of paths of length k from u to f is 

Pk{u,v)<d^/\ 

3. For any u,v G V, the number of paths of length k + 1 from u to u is 

Pk+iiu,v) < — 

Remark. In the second condition, d^^^ is somewhat arbitrary. For k constant, it would be enough 
to require Pk{u,v) = o{d/ log d). However, having a larger gap between Pk{u,v) and d allows our 
framework to work for larger (non-constant) values of k. 

Observe that d-regular graphs of girth 2k + 1 satisfy V{d, k,t = d^)^ because there is at most one 
path of length k between any pair of vertices. Thus our embedding results for graphs satisfying this 
property implies similar statements for regular graphs of fixed girth, although somewhat weaker than 
those we presented in Section 21 Our main focus in this section is on random graphs. 

Proposition 5.1 A random graph Gn,p where ^ > p > n°-~^, a > constant, satisfies almost surely 
V{d, k, t) with t = {1 — o{l))n, d = {1 — o{l))pn and k > 1 chosen so that 

^(pn)-3/4 < p'^n'^-i < ^{pn)^/\ 

Proof. Since we assume > n"", we have k < l + l/a, otherwise p'^n'^""'^ = p{pn)^~^ > pn >> (pn)^^^ 
contradicting our choice of k. Hence, A; is a constant. 

The degree of every vertex in Gn,p is a binomially distributed random variable with parameters n 
and p. Thus, by standard tail estimates (Chernoff bounds), the probability that it is smaller than 

d = pn — ^/pnlog n = (1 — o{l))pn 

is 6"^*-^°^ = o(l/n). Therefore with high probability the minimum degree of Gn,p is at least d. 
The expected number of paths of length k from u to v is 

E[Pkiu,v)] < p^n^-^ < ^{pnfl^ 

by our choice of k. We use the Kim-Vu inequality [16] to argue that Pk{u,v) is strongly concentrated. 
Let te be the indicator variable of edge e. We can write 

P eeP 
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where P runs over all possible paths of length k between u and v. Clearly, this is a multilinear 
polynomial of degree k. Let -^Pk{u,v) denote the partial derivative of Pk{u,v) with respect to all 
variables in the set /. Using the notation of [16] , we set 



maxE 
\i\=i 



E = maxj>o Ei and E' = maxj>i Ei. In particular, Eq is the expected value of Pk{u, v). The Kim-Vu 
inequality states that 

¥[\Pkiu,v) - Eo\ > akX^VWE] = 0(e-^+('=-i)i°g") 



for any A > 1 and = S^vM- In our case, E 



can be seen as the expected number 



of u-v paths of length k with i edges already fixed to be on the path. For any choice of such i 
edges, if i < /c, we have at most n'^"*"^ choices to complete the path and the probability that such 
a path appears is Hence, Ei < p^-i'^^-i'-^ for i < k. For i = k, we have Ef^ = 1. Hence, 

E = maxj>o -Ei < p^n^~^ < \{pn)^^^ and E' = maxi>i E'j < 1. By the Kim-Vu inequality with 
A = (A; + 2) log n, we have 

F[\PUu,v)-Eo\>a',{pn)'/'log'n] = 0(e-3i°s") = ©(n'^), 

where a'^ = {k + 2fak = S^{k + 2)'=\/M. Thus, we get for all pairs (u, v) that with high probability 

Pk{u,v) < EQ + a'^{pnfl^\og^n< h^pnf/^ + a',^{pnYl^\og^n< ^{pny/^ < d^/\ 

To estimate Pk+i{u,v), we use a similar argument. Again, this is a multilinear polynomial 
Pk+i{u,v) = X^pIleeP^e' ^^^^ time of degree k + 1. The expectation is Eq = E[P,t+i(u, ?;)] < p^+^n^. 
Further, we get E^ < p^+^-'^n^-'^ for i < k, Ej^^i = 1 and therefore, E = maxj>o -Ej = Eq. Since our 
choice of k implies that £"0 = (1 — o{l))p''~^^ n'' > {pn)^/^ /b, we also have 



E' 



max Ei 

i>l 



max [p^n'' \ l) < 5Eo/{pn) 



1/4 



By Kim-Vu with X = {k + 2) log n. 



F[\Pk+i{u,v) - Eo\ > a'k^/EE^ log'' n] = 0{e-^'°^^) = 0{n-^) , 

where a'^ = {k + 2)''ak is a constant. Note that a^\/ EE' log ''n< 5a'fclog^n£;o/(pn)V8 = o{Eo). Recall 
also that d = {1 — o{l))pn and t = {1 — o(l))n. Thus, for all pairs (n, v) with high probability 

Pk+iiu,v) <Eo + oiEo) < (1 + o(l))/+in^- < d'^+^/t □ 



Algorithm 4. Start by making k random moves from an arbitrary vertex vq £ V, in each step 
choosing a random neighbor Vi-^-i £ N{vi). Embed the root of the tree r £ T at f{r) = v^. 

As long as T is not completely embedded, take an arbitrary vertex u £ V{T) which is embedded but 
its children are not. If f{u) has enough available neighbors in N{f{u)) unoccupied by other vertices 
of T, embed the children of u one by one by choosing vertices randomly from the available neighbors 
of f{u). Otherwise, fail. 

The following is our main theorem. 
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Theorem 5.2 Let G be a graph on n vertices satisfying property V{d,k,t) for d > log® n, k < logn 
and e,6 > are such that 

(2k€)^/^ + 5 + ^ < 1. (1) 

Then for any tree T of maximum degree at most 5d and size at most et, the algorithm above finds 
embedding of T with high probability. 

This resuh has an interesting consequence aheady for k = 1. Let G be a graph on n vertices with 
minimum degree pn such that every two distinct vertices of G have at most 0{p^n) common neighbors. 
For p ^ there are several known explicit construction of such graphs and their properties were 

extensively studied by various researchers (see, e.g., survey pO] and its references). Our theorem 
implies nearly optimal embedding results for such G and shows that it contains every tree of order 
0(n) with maximum degree Q{pn). 

Considering the extreme values of e and 5 that satisfy ([T]), we obtain embeddings of 

• trees with maximum degree at most a constant fraction of d (e.g., jd) and size 2~^^^H. 

• trees with maximum degree 0{d/k) and size 0{t/k). 

Combining Theorem 15.21 with Proposition 15. 11 we see that for a random graph Gn,p with p = n"'~^ and 
constant a > we can use d ~ pn, t n and k ~ 1/a. Therefore for such p we are embedding trees 
whose size and maximum degree are proportional to the order and minimum degree of Gn^p- This is 
clearly tight up to constant factors. 

Before proving the theorem, we outline the strategy of our proof. Our goal is to argue that there 
is some a > such that no more than ad vertices are ever occupied in any neighborhood N{v), 
including vertices embedded through v itself. Again, we consider the number X^j of vertices in N(v) 
occupied by vertices of T, other than those embedded as children of v. The "bad event" By occurs 
when > d/k and we stop the algorithm immediately after the first such event. At most 6d vertices 
can be embedded as children of v, therefore assuming that no bad event happens, at most {1/k + 5)d 
vertices are eventually occupied in any neighborhood N{v). Since l/k + 6 <l- (2A;e)i/^' by (H}, we 
can set 

a = 1 - {2key/^. 

If no bad even occurs, any vertex of T has at least (1 — a)d choices available for its embedding. If a 
bad event occurs, we can assume that the algorithm fails. 

We estimate the probability of Bjj by studying the random variable X^. The expectation E[Xt,] 
is bounded relatively easily, since this is determined by the number of possible ways that a vertex 
of T can reach the neighborhood N{v). This can be bounded using our property V{d,k,t). The 
more challenging part of the proof is to argue that the probability of By is very small, since the 
contributions from different vertices of the tree are not independent. We handle this issue by dividing 
the contributions into blocks of variables which are effectively independent. We write Xy = X^fLj^ Yy^i 
and use a supermartingale tail estimate to bound each Yy^i. 

The following definitions are similar to those in Section [H 
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Definition 5.3 For a rooted tree T, with a natural top-to-hottom orientation, let Lk-i{x) define the 
set of descendants k — 1 levels down from x £ V{T). 

For a vertex v S V{G), denote by the number of vertices in T that end up embedded in N{v), 
before the children of f^^{v) are embedded. 

For a tree vertex x £ V{T), denote by X^^x the number of vertices in Lk-i{x) that end up embedded 
in N{v), before the children of f^^{v) are embedded. 

As in Section HJ we extend T to a larger tree T* by adding a path of k auxiliary vertices above 
the root. Each embedded vertex y is a (A; — l)-descendant of some x £ V{T*) and hence V{T) = 
UxeV(r*) ^k~iix). By summing up the contributions over x £ V{T*), we get 

Lemma 5.4 Assume G satisfies property V{d,k,t) and fix a tree vertex x £ V{T). Then X^^x is 
bounded by d^^^ with probability 1, and 

nX^^x I T] < (l-a)-'=|Lfc_i(x)|^ 

where T is any fixed embedding of the entire tree T except for the vertex x and its descendants. 

Proof. Assume that conditioned on T, the parent g of x is embedded at f{q) = w £ V{G). The only 
way that a vertex y £ Lk~i{x) can end up in N{v) (but not through v) is when some branch of the 
tree T from g to y is embedded in a path of length k from w to N{v), avoiding v. Such paths can be 
extended uniquely to paths of length A; + 1 from w to v. We know that the number of such paths is 
bounded by Pk+i{w,v) < d'^^^/t. 

Since there are at least (1 — a)d choices when we embed each vertex, the probability of following 
a particular path of length k is at most By the union bound, the probability that y is 

embedded in N{v) is 

P[/(y) £ Niv) I T] < ^ ,,,, Pk+iiw, v)< 



Finally, 

Similarly, the number of paths of length k — 1 from any vertex u to N(v), avoiding v, is the same 
as the number Pk{u, v) of paths of length k from u to v. Even if all these Pk{u, v) paths are used in the 
embedding of T, the vertices in Lk-i{x) cannot occupy more than Pk{u,v) neighbors off. Therefore, 
we can always bound X^^x < Pk{u,v) < d^^'^. □ 

Next, we want to argue about the concentration of X^ = Y2x<^v{T*) -Xv,x- Since the placements of 
different vertices in T are highly correlated, it is not clear whether any concentration result applies 
directly to this sum. However, we can circumvent this obstacle by partitioning V{T*) into subsets 
where the dependencies can work only in our favor. 
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Definition 5.5 Let r* be the root of T* , then every vertex of T* is in Lj{r*) for some j. Define a 
partition V{T*) = U VFi U . . . U Wk-i by 

j'=j (mod A:) 

For each vertex v G y{G) and < j < k, define 

xeWj 

Obviously, we have Xy = ^^^eViT*) -^■>',^ ~ Sj=o -^^j- following, we argue that each Y^j 

has a very small one-sided tail. 

Lemma 5.6 Let ij = ExeVF \Lk-iix)\. Then E%j] < (1 - a)-^^fd and 



n,>(l-ar^(| + -)d 



Proof. By Lemma [5.4[ we know that E[Xt,^j: \ T] < {1 — where T is any fixed 

embedding of T except x and its subtree. Therefore, the same also holds without any conditioning. 
By taking a sum over all x G Wj, 

E[Y,j] = E[X,,,] < (1 - \Lk-i{x)\^ = (1 - 

xeWj xeWj 

For a tail estimate, we use Proposition ll.il Write the vertices of Wj = {xi,X2, ■ ■ ■ , Xr} in order as they 
are embedded by the algorithm and write Xi = d~^/^Xy^Xi- The important observation is that the 
values of Xi,X2, . . . , Xi-i are determined if we are given the embedding of the tree T except for the 
vertex Xi and its subtree (let's denote this condition by 7^). This holds because Xi, . . . depend 
only on the embedding of vertices xi, . . . , Xi-i and their subtrees of depth k — 1. Since all these vertices 
are either at least k levels above Xi in the tree T, or on the same level or below (but not in the subtree of 
Xi), their subtrees of depth k—1 are disjoint from the subtree of Xj. Hence, conditioning on % is stronger 
than conditioning on Xi, . . .,Xi^i. Since E[Xi \ %\ = d'^/^ElX^^^^ | < (1 - a)-''\Lk-i{xi)\^, 
we can also write 

E[Xi I < (l-a)-'^|Lfc_i(x,)| — . 

The range of is [0, d^/^], hence G [0,1]. Summing over VFj, we have < {l-a)-''£j'^, 

so let's set ^ = (1 - a)~''£j^-^. By Proposition ll.il 

¥[YXi> {l + e')fi]<e-'^. 
Using that £j < \T\ < et, for e' = j^, we get 



[Yx,>^^+'-{l-ar'd^/' 



Since Y^uj = d^f'^ ^ Xi, this proves the claim of the lemma. □ 
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Lemma 5.7 Let denote the "had event" that > d/k. Assuming that ^ holds, and \T\ < et, 
then for any fixed vertex v the had event happens with prohahility 

F[B^] Kke"^. 



Proof. We have Xy = Yl'i=o ^-"d- Recall that (1 — a)^ = 2ke. By Lemma [5^ 



Yyj > (1 - a 



'"'In'" 



^ g 3fe2(l_c,)': 



_d3/4 

e 6fc3 



for each j = 0, 1, 2, . . . , A; — 1. By the union bound, the probability that any of these events happens 
is at most ke~'^^^* /^^^ . If none of them happen, we have 



fc-i 

Xv = < (1 

j=0 



a) 



i=o ^ ^ 



a) 



— + e]d< [l-a)-^ ■2ed=^. 
t I k 



□ 



To finish the proof of Theorem 15.21 we note that d > log® n and k < log n. The probabilities of 
bad events are bounded by ke~'^^^'^/^^^ < (logn) e~6'°s " < l/n^°^". There are n potential bad 
events, so none of them occurs with high probability. 



6 Concluding remarks 

In this paper we have shown that a very simple randomized algorithm can find efficiently tree em- 
beddings with near-optimal parameters, surpassing some previous results achieved by more involved 
approaches. Here are few natural questions which remain open. 

• It would be interesting to extend our results from graphs of girth 2A; -|- 1 to graphs without cycles 
of length 2k. For = 3, this follows from our work combined with a result of Gyori. In [13] he 
proved that every bipartite Ce-free graph can be made also C4-free by deleting at most half of its 
edges. Therefore given a Cg-free graph with minimum degree d, we can first take its maximum 
bipartite subgraph. This will decrease the number of edges by at most factor of two. Then we 
can use the above mentioned result of Gyori to obtain a C4-free and Cg-free graph which has at 
least a quarter of the original edges, i.e., average degree at least (i/4. In this graph we can find a 
subgraph where the minimum degree is at least d/S (1/2 of average degree). Since it is bipartite, 
this subgraph has no cycles of length shorter than 7. This shows that every Cg-free graph G 
with minimum degree d contains a subgraph G' of girth at least 7 whose minimum degree is a 
constant fraction of d. Using our result, we can embed in G' (and hence also in G) every tree of 
size 0{d'^) and maximum degree 0{d). 

More generally, it is proved in [19] that any C2/c-free graph contains a C4-free subgraph with at 
least 2p3iy -fraction of its original edges. Moreover it is conjectured in [19], that any C2A;-free 
graph contains a subgraph of girth 2k + 1 with at least an e^-fraction of the edges. If this 
conjecture is true, it shows that the tree embedding problems for C2/fc-free graphs and graphs of 
girth 2fc -|- 1 are equivalent up to constant factors. 
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• For random graphs Gn,p our approach works most efficiently when the edge probability p = n""^ 
for some constant a > 0. Nevertheless, it can be used to embed trees in sparser random graphs 
as well. By analyzing more carefully the application of the Kim-Vu inequality, one can show that 
for every fixed e > 0, a random graph with edge probability p > e'°s^^^^^ "/n satisfies 7^(cZ, k, n/2) 
with d c::^ pn and k ~ log^ n. However, when p = n~^~^°^^^ we have k ^ oo and therefore both the 
maximum degree an the size of the tree we can embed are only an o(l)-fraction of the optimum. 
It would be extremely interesting to show that for edge probability p = n~^^°^^\ perhaps even 
p = c/n for some large constant c > 0, the random graph Gn,p still contains every tree with 
maximum degree 0{pn) and size 0(n). 

It would be also nice to weaken our pseudorandomness property V{d,k,t) which is defined in 
terms of numbers of paths between pairs of vertices. The most common definition of pseudoran- 
domness is in terms of edge density between subsets of vertices of a graph. In particular, it would 
be interesting to extend our results to embedding of trees in graphs whose edge distribution is 
close to that of random graph. 

• Finally, we wonder if there are any additional interesting families of graphs for which one can show 
that our simple randomized algorithm succeeds to embed trees with nearly optimal parameters. 
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